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This Raw Listing contains the General 
Information Section and up to the first 5 pages. 



1 SEQUENCE LISTING 

3 (1) General Information: ENTERED 

5 (i) APPLICANT: Inouye, Sumiko 

6 Hsu, Mei-Yin 

7 Eagle, Susan 

8 Inouye, Masayori 
9 

10 . (ii) TITLE OF INVENTION: PROKARYOTIC REVERSE TRANSCRIPTASE 
U 

12 (iii) NUMBER OF SEQUENCES: 52 
13 

14 (iv) CORRESPONDENCE ADDRESS: 

15 (A) ADDRESSEE: WEISER & ASSOCIATES 

16 (B) STREET: 230 South Fifteenth Street, Suite 500 

17 (C) CITY: Philadelphia 

18 (D) STATE: PA 

19 (E) COUNTRY: USA 

20 (F) ZIP: 19102 
21 

22 (V) COMPUTER READABLE FORM: 

23 (A) MEDIUM TYPE: Floppy disk 

24 (B) COMPUTER: IBM PC compatible 

25 (C) OPERATING SYSTEM: PC- DOS/MS- DOS 

26 (D) SOFTWARE: Patentln Release #1.0, Version #1.30 
27 

28 (vi) CURRENT APPLICATION DATA: 

29 (A) APPLICATION NUMBER: US 08/808,031 

30 (B) FILING DATE: 03-MAR-1997 

31 (C) CLASSIFICATION: 
32 

33 (viii) ATTORNEY/ AGENT INFORMATION: 

34 (A) NAME: Weiser, Gerard J. 

35 (B) REGISTRATION NUMBER: 19,763 

36 (C) REFERENCE/DOCKET NUMBER: 377 ( 91 3 ) . 5888P 
37 

38 (ix) TELECOMMUNICATION INFORMATION: 

39 (A) TELEPHONE: 215-875-8383 

40 (B) TELEFAX: 215-875-8394 
41 

42 

4 3 (2) INFORMATION FOR SEQ ID NO:l: 
44 

45 (i) SEQUENCE CHARACTERISTICS: 

46 (A) LENGTH: 2176 base pairs 
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RAW SEQUENCE LISTING DATE: 05/06/1999 

PATENT APPLICATION US/08/808,031A TIME: 08:25:12 

INPUT SET; S31745.mw 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 640.. 2094 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 



TCATCCGCGC GGACACCCCC TCCTACGTGC CCCCCGACGC GGAGAGCGGC GTGGAGACGG 60 

TGTACCGCGT TTCCCTGGAT GGTCACCTGG TGGCGGTGGA GTGGGGCCCG CGCACGGGCT 120 

CGCCGCGTCA CCAGCGGCTC TGGTTCGACT CGGATGCGGA AGCCCCCGGA GCCTACTTCG 180 

CGCGCCTCGA GAAGTTGGCG GCTGACGGCT ACATCGACGC GGCCTCGGCA TTGGTCTAAA 240 

CCCTTCAACC ACGGCTCGGC CGCCACGCGC GGCCGGCAGG ACAGGTGCGA CGAACAGACG 300 

ACGACGTGCG CTTCACGCGC GAGCAGCCGA GAGAGGTCCG GAGTGCATCA GCCTGAGCGC 360 

CTCGAGCGGC GGAGCGGCGT TGCGCCGCTC CGGTTGGAAT GCAGGACACT CTCCGCAAGG 420 

TAGCCTGTTC TTGGCTCTCT CCCTCCTAGG CACTACGGCC AGGGTGGGTA GCGGAGCCAA 480 

CGACGCCACC GCCGTTTACC CACCCCGGCC GTAGTGCCTA GGAGGGGAGA GCCGGTGAGG 540 

CTACCGTGCC CCAGGTAAGA TGGTGGTGCT TTCCCGGCCT CCGTCGACTG CTCGCGCCAT 600 

GTCCCGTCTT CCATCGCCGC GCCCGCCCAA GGTGCAGAC ATG ACC GCC AGG CTG 654 

Met Thr Ala Arg Leu 
1 5 

GAC CCG TTC GTC CCC GCA GCT TCG CCG CAG GCC GTG CCC ACG CCC GAG 702 
Asp Pro Phe Val Pro Ala Ala Ser Pro Gin Ala Val Pro Thr Pro Glu 
10 15 20 



CTC ACC GCT CCG TCG TCA GAC GCG GCC GCG AAG CGT GAA GCC CGC CGG 750 
Leu Thr Ala Pro Ser Ser Asp Ala Ala Ala Lys Arg Glu Ala Arg Arg 
25 30 35 



CTC GCG CAC GAA GCG TTG CTC GTC CGC GCG AAG GCC ATC GAC GAA GCG 7 98 

Leu Ala His Glu Ala Leu Leu Val Arg Ala Lys Ala lie Asp Glu Ala 
40 45 50 



GGC GGC GCC GAC GAC TGG GTG CAG GCG CAG CTC GTC TCC AAG GGG CTC 846 
Gly Gly Ala Asp Asp Trp Val Gin Ala Gin Leu Val Ser Lys Gly Leu 
55 60 65 
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Lys 


Arg 
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GCG 


114 


Gly Ala 


115 






i i c 

1 X O - 
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1 1 Q 
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Val 


Pro 


119 ~ 
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1 1 ft 
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lzz 


Asp 


Ser 


12 3 
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1 ZD 


Leu 


Arg 


127 






1 zo 
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129 


GTG 


AGC 


1 1 A 
1 3U 


Val 


Ser 


TOT 
131 
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134 


Ser 


Pro 


135 
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1 "5 ft 
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INPUT SET: S3174S.raw 
\C AAG AAG 894 



75 



90 



80 35 



95 100 



105 HO US 

GGC GTG CAC TGG GCG GAG GAC CGC CTG GCC GAC GCG TTC GAC 

Gly Val His Trp Ala Glu Asp Arg Leu Ala Asp Ala Phe Asp 

120 125 130 

CAC CGC GAG GAG CGC GCC CGG GCC AAC GGC CTG ACG GAG CTG 

His Arg Glu Glu Arg Ala Arg Ala Asn Gly Leu Thr Glu Leu 

140 145 

GCG GAG GCG CTG GCC AAG GCG CTG GGG CTG AGC GTC TCC AAG 

Ala Glu Ala Leu Ala Lys Ala Leu Gly Leu Ser Val Ser Lys 

155 160 165 

TGG TTC GCG TTC CAC CGG GAG GTC GAC ACG GCC ACG CAC TAC 
Trp Phe Ala Phe His Arg Glu Val Asp Thr Ala Thr His Tyr 

170 175 180 



185 



190 195 



200 205 210 



220 225 



235 240 245 



250 



265 



255 260 



270 275 



942 



990 



1038 



1086 



1134 



1182 



1230 



1278 



1326 



1374 



1422 



1470 



PAGE- 4 RAW SEQUENCE LISTING DATE: 05/06/1999 

PATENT APPLICATION US/08/808,031A TIME: 08:25: 12 

INPUT SET: S31745.mw 

15 3 ACG CTG CTG TCC CTC CTC TCC ACG GAA GCG CCG CGG GAG GCG GTC CAG 1518 

154 Thr Leu Leu Ser Leu Leu Ser Thr Glu Ala Pro Arg Glu Ala Val Gin 

155 280 285 290 
156 

157 TTC CGC GGC AAG CTC CTG CAC GTC GCC AAG GGC CCG CGC GCC CTG CCC 1566 

158 Phe Arg Gly Lys Leu Leu His Val Ala Lys Gly Pro Arg Ala Leu Pro 

159 295 300 305 
160 

161 CAG GGC GCC CCC ACG TCG CCC GGC ATC ACC AAC GCG CTC TGC CTG AAG 1614 

162 Gin Gly Ala Pro Thr Ser Pro Gly lie Thr Asn Ala Leu Cys Leu Lys 

163 310 315 320 325 
164 

165 CTC GAC AAG CGG CTG TCC GCC CTC GCG AAG CGG CTG GGC TTC ACC TAC 1662 

166 Leu Asp Lys Arg Leu Ser Ala Leu Ala Lys Arg Leu Gly Phe Thr Tyr 

167 330 335 340 
168 

169 ACG CGC TAC GCG GAC GAC CTG ACC TTC TCC TGG ACG AAG GCG AAG CAG 1710 

170 Thr Arg Tyr Ala Asp Asp Leu Thr Phe Ser Trp Thr Lys Ala Lys Gin 

171 345 350 355 
172 

173 CCC AAG CCG CGG CGG ACG CAG CGT CCC CCC GTC GCG GTC CTC CTG TCT 1758 

174 Pro Lys Pro Arg Arg Thr Gin Arg Pro Pro Val Ala Val Leu Leu Ser 

175 360 365 370 
176 

177 CGC GTC CAG GAA GTG GTG GAG GCG GAG GGC TTC CGC GTG CAC CCG GAC 1806 

178 Arg Val Gin Glu Val Val Glu Ala Glu Gly Phe Arg Val His Pro Asp 

179 375 380 385 
180 

181 AAG ACG CGC GTC GCC CGC AAG GGC ACG CGG CAG CGG GTC ACC GGG CTC 1854 

182 Lys Thr Arg Val Ala Arg Lys Gly Thr Arg Gin Arg Val Thr Gly Leu 

183 390 395 400 405 
184 

185 GTC GTG AAT GCG GCG GGC AAG GAC GCG CCC GCG GCC CGA GTC CCG CGC 1902 

186 Val Val Asn Ala Ala Gly Lys Asp Ala Pro Ala Ala Arg Val Pro Arg 

187 410 415 420 
188 

189 GAC GTC GTC CGC CAG CTC CGC GCC GCC ATC CAC AAC CGG AAG AAG GGC 1950 

190 Asp Val Val Arg Gin Leu Arg Ala Ala lie His Asn Arg Lys Lys Gly 

191 425 430 435 
192 

193 AAG CCG GGC CGC GAG GGC GAG TCG CTC GAG CAG CTC AAG GGC ATG GCC 1998 

194 Lys Pro Gly Arg Glu Gly Glu Ser Leu Glu Gin Leu Lys Gly Met Ala 

195 440 445 450 
196 

197 GCC TTC ATC CAC ATG ACG GAC CCG GCC AAG GGC CGC GCC TTC CTG GCT 2046 

198 Ala Phe lie His Met Thr Asp Pro Ala Lys Gly Arg Ala Phe Leu Ala 

199 455 460 465 
200 

201 CAG CTC ACG GAG CTC GAG TCC ACG GCG AGC GCC GCT CCG CAG GCG GAG 2094 

202 Gin Leu Thr Glu Leu Glu Ser Thr Ala Ser Ala Ala Pro Gin Ala Glu 

203 470 475 480 485 
204 

205 TGACGCTCAG CGCGCGTCCG TCGCCGACGT GCCGCGCGCC AGCAACGCCG CATTCAGCAA 2154 



PAGE: 5 RAW SEQUENCE LISTING DATE: 05/06/1999 

PATENT APPLICATION US/08/808,031A TIME: 08:25:13 

INPUT SET: S31745.mw 

206 

207 CTCCGTCAGC CGGCGCGGGT AC 2176 

208 

209 

210 (2) INFORMATION FOR SEQ ID NO: 2: 
211 

212 (i) SEQUENCE CHARACTERISTICS: 

213 (A) LENGTH: 485 amino acids 

214 (B) TYPE: amino acid 

215 (D) TOPOLOGY: linear 
216 

217 (ii) MOLECULE TYPE: protein 

218 

219 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

220 

221 Met Thr Ala Arg Leu Asp Pro Phe Val Pro Ala Ala Ser Pro Gin Ala 

222 15 10 15 
223 

224 Val Pro Thr Pro Glu Leu Thr Ala Pro Ser Ser Asp Ala Ala Ala Lys 

225 20 25 30 
226 

227 Arg Glu Ala Arg Arg Leu Ala His Glu Ala Leu Leu Val Arg Ala Lys 

228 35 40 45 
229 

2 30 Ala lie Asp Glu Ala Gly Gly Ala Asp Asp Trp Val Gin Ala Gin Leu 

231 50 55 60 

232 

233 Val Ser Lys Gly Leu Ala Val Glu Asp Leu Asp Phe Ser Ser Ala Ser 

234 65 70 75 80 
235 

236 Glu Lys Asp Lys Lys Ala Trp Lys Glu Lys Lys Lys Ala Glu Ala Thr 

237 85 90 95 
2 38 

239 Glu Arg Arg Ala Leu Lys Arg Gin Ala His Glu Ala Trp Lys Ala Thr 

240 100 105 110 
241 

242 His Val Gly His Leu Gly Ala Gly Val His Trp Ala Glu Asp Arg Leu 

243 115 120 125 
244 

245 Ala Asp Ala Phe Asp Val Pro His Arg Glu Glu Arg Ala Arg Ala Asn 

246 130 135 140 
247 

248 Gly Leu Thr Glu Leu Asp Ser Ala Glu Ala Leu Ala Lys Ala Leu Gly 

249 145 150 155 160 
250 

251 Leu Ser Val Ser Lys Leu Arg Trp Phe Ala Phe His Arg Glu Val Asp 

252 165 170 175 
253 

254 Thr Ala Thr His Tyr Val Ser Trp Thr lie Pro Lys Arg Asp Gly Ser 

255 180 185 190 
256 

257 Lys Arg Thr lie Thr Ser Pro Lys Pro Glu Leu Lys Ala Ala Gin Arg 

258 195 200 205 



PAGE: 1 SEQUENCE VERIFICATION REPORT DATE: 05/06/1999 

PATENT APPLICATION US/08/808,031 A TIME: 08:25:13 

INPUT SET: S3174S.mw 
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